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12.1 Introduction to Threshold- 


Dependent Gene Drives 


Gene drives have been proposed as valuable 
tools in the fight against a range of globally 
important issues, including vectors of dis- 
ease, invasive species and agricultural pests. 
These approaches are classified primarily 
based on their persistence and/or inva- 
siveness. Here we consider persistent (i.e. 
self-sustaining) and low-invasiveness 
(i.e. threshold-dependent) approaches using 
engineered underdominance as a case study. 

Gene drive is a phenomenon whereby a 
particular gene (or suite of genes) can bias 
inheritance in its (their) own favour, thus 
allowing it to increase in frequency over suc- 
cessive generations, even when deleterious to 
carrier individuals (Sinkins and Gould, 2006; 
Alphey, 2014, 2020; Champer et al., 2016; 
NASEM, 2016; Leftwich et al., 2018;) (see Ra- 
ban and Akbari, Chapter 8; Champer, Chapter 9, 
this volume). This may occur by a range 
of natural or synthetic mechanisms, most 
of which act in one of two ways: (i) conversion 
of progeny individuals into other genotypes; 
or (ii) reducing the fitness (or killing) of pro- 
geny individuals of certain genotypes. 


The precise configuration of gene drive 
components can lead to systems with a wide 
range of different behavioural characteris- 
tics, and these are often used to classify the 
various gene drive systems. Perhaps the 
most common classifications are based on 
their intended purpose (usually population 
modification or suppression), invasiveness 
(ability to spread into non-target popula- 
tions) and persistence (whether they remain 
in the population or diminish over time). 

Owing mainly to coverage in popular 
media outlets, the term gene drive is often 
associated with only widely known systems 
such as some CRISPR-based homing ap- 
proaches. These are expected to have rela- 
tively straightforward behaviour in that 
they are highly invasive (spreading from ex- 
tremely small releases and so likely also to 
spread to all populations linked by any de- 
gree of gene flow), highly persistent (at least 
in absence of resistance) and able to be used 
flexibly for either population suppression or 
modification (see Bottino-Rojas and James, 
Chapter 11, this volume). As discussed 
previously (James, 2005; NASEM, 2016; 
Harvey-Samuel et al., 2019; Long et al., 
2020; Lanzaro et al., 2021), the first gene 
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drive trials are likely to be conducted in re- 
mote areas such as highly isolated islands to 
limit the probability of spill-over into 
non-target populations. It is debatable 
whether geographical containment of this 
type is adequate for non-localized (some- 
times also referred to as ‘global’) gene drive 
approaches, as it would be difficult to guar- 
antee perfect confinement within the trial 
area. Another potential drawback is that any 
gene drive releases would presumably need 
to gain regulatory approval from all affected 
countries, which for global systems could be 
argued to be all countries in which the target 
species (and any capable of forming fertile 
hybrids) are present (see Beech et al., Chap- 
ter 25, this volume). For the first gene drive 
trial releases, such widespread regulatory 
approval required for a global system would 
seem challenging to obtain, at least for 
widely distributed species. Localized gene 
drives may need approval only in the target 
territory and may be particularly suitable 
where homogenous modification of every 
population of the species is not desired. 

Though not as widely discussed in the 
media as non-localized drives, several designs 
for localized drives have been proposed and 
subject to considerable analysis. Here we focus 
on two-locus engineered underdominance 
(UD) - an example of threshold-dependent 
gene drive that should be persistent, revers- 
ible and spatially restricted. This would ap- 
pear to answer much of the concern around 
non-localized gene drive approaches, since 
the system is unlikely to spread significantly 
beyond any initial trial site and can be re- 
versed easily in the event of unintended con- 
sequences — features likely to prove desirable 
to regulatory bodies and other stakeholders 
in the context of initial proof-of-concept 
gene drive field trials. 

In this chapter, we outline a range of 
modelling approaches that have been used 
to demonstrate the key characteristics of 
this approach, including threshold introduc- 
tion frequencies, reversibility, spatial limita- 
tion and robustness to mutation/resistance. 
We then go on to discuss alternative config- 
urations based on the use of sex-specific 
components and their effect on introduction 
thresholds. We conclude with a discussion 
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on the cycle of information between math- 
ematical models and experimental data along 
with a range of areas for future modelling 
that will be important in providing informa- 
tion on the anticipated effects of these sys- 
tems when released into target populations. 


12.2 Two-Locus Engineered 
Underdominance 


Underdominance, also known as negative 
heterosis, is a natural phenomenon and the 
opposite of the better-known overdomi- 
nance (positive heterosis or hybrid vigour). 
Thus, underdominance occurs where hy- 
brids are of lower fitness than either of two 
different true-breeding parental strains; for 
practical purposes, one of these parental 
strains is wild-type, the other is the under- 
dominance-based (UD) gene drive strain. In 
a single locus scenario, modelling of such se- 
lection against hybrids has been shown to 
allow for the eventual fixation of one allele, 
with the other being eliminated (Wilson and 
Turelli, 1986; Altrock et al., 2010, 2011). 
More recently, transgenes displaying these 
properties have been developed and tested 
(Reeves et al., 2014; Maselko et al., 2020; 
Buchman et al., 2021). The UD concept can also 
be expanded beyond a single locus. UD gene 
drives can potentially be developed using 
transgenic constructs containing toxin and 
antidote components. The particular config- 
uration considered here is two-locus UD as 
originally proposed by Davis et al. (2001). 
This approach requires two distinct trans- 
genic constructs to be inserted at independ- 
ently segregating (unlinked) genomic loci, 
each of which comprises a lethal effector (toxin) 
and a suppressor (antidote) for the toxin of 
the other transgenic construct (Fig. 12.1). 
One (or optionally both) of these transgenic 
constructs will also contain a genetic cargo 
aimed at producing a desirable phenotype 
within the target population, for example a 
reduced ability to transmit a given patho- 
gen (e.g. Franz et al., 2006, 2014; Buchman 
et al., 2019) (see Franz, Chapter 22, this 
volume). This combination of transgenic 
components gives an underdominance-like 
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Fig. 12.1. A schematic diagram illustrating the workings of a two-locus engineered 
underdominance gene drive system. (Left) This gene drive design requires the introduction of two 
distinct transgenic constructs at independently segregating genomic loci. Each transgenic construct 
comprises a toxin, an element suppressing the toxin of the other transgenic construct and a desirable 
genetic cargo (which may be included within one or both constructs). (Right) A Punnett square 


demonstrating the creation of an underdominance-| 


ike effect. Here haplotypes outside the square 


represent gametes from a viable parent (one maternal and one paternal). The resulting offspring 
genotypes are listed inside the square and it is here that lethality manifests in individuals carrying just 
one of the transgenic constructs (grey crosses), creating a selective pressure for individuals to carry 


either both transgenic constructs or neither. 


effect by creating a negative selection (via a 
lethal effect) on individuals carrying just 
one of the transgenic constructs (since they 
contain a toxin but not the antidote from 
the other transgenic construct). This results 
in a positive selection pressure for individ- 
uals to carry either both or neither of the 
transgenic constructs (Fig. 12.1). The 
precise strength of this selection pressure is 
dependent on several factors, including the 
degree of lethality conferred on affected 
genotypes and the fitness cost caused by the 
presence of the gene drive constructs, for ex- 
ample imperfect suppression of the lethals 
by the suppressors, or insertional effects of 
the transgene on nearby genes. 


12.3 Mathematical Modelling 
Approaches 


Building gene drives in the laboratory is an 
inherently time-consuming and expensive 
activity. Mathematical modelling, on the 
other hand, can be conducted relatively 
quickly and, by comparison, inexpensively 
and allows systematic exploration of param- 
eter space far more readily than empirical 
methods. It is therefore extremely beneficial 
to model any proposed gene drive approach 
in advance of (or at least concurrently with) 
the laboratory development of transgenic 


components, and this has indeed been wide- 
spread practice. Models can be used to deter- 
mine essential performance targets that 
must be met for engineered gene drives to 
achieve their intended function, particularly 
within laboratory-based experiments or 
field-based trial releases. The structure and 
complexity of such gene drive models can 
vary dramatically, with each having their 
own respective benefits and limitations. It 
is here that experienced mathematical 
modellers are key in determining the most 
appropriate model to use in any given scen- 
ario while understanding and being able to 
communicate to non-modellers how a given 
model structure may influence modelling 
outcomes. It seems intuitive that math- 
ematical modelling has the potential to save 
vast amounts of experimental time, effort 
and money where gene drive designs and en- 
gineered components are not fit for purpose. 
Perhaps less obvious is the requirement 
for models, and model-based conclusions, 
to be used (or at least scrutinized) by ex- 
perienced modellers who understand the 
impact of model assumptions, complexity 
and limitations. 

Avariety of mathematical modelling ap- 
proaches have been used to provide insight 
into the predicted performance of UD gene 
drives; however, much of this work has fo- 
cused on deterministic population genetics 
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models. These can provide insight into the 
basic function and utility of UD systems. 
Such models typically consider an idealized 
scenario consisting of an infinitely large 
population (avoiding stochastic effects and 
the need for integer numbers of individuals) 
that is isolated (closed from any migration) 
and panmictic (randomly mating). For 
simplicity, it is also commonly assumed that 
females mate only once and produce a 1:1 
(female to male) sex ratio in their offspring. 
Attention is typically restricted to the case 
whereby no resistance mechanisms can 
emerge and where each component of the 
introduced transgenes is immutable and per- 
fectly linked (i.e., toxin, antidote and cargo 
genes are unable to separate from one an- 
other). Finally, it is commonly assumed that 
generations of offspring in modelled popula- 
tions are synchronous (i.e., non-overlapping), 
which may not always be realistic but can 
apply to laboratory caged populations or 
wild populations that are synchronized by 
climatic factors (e.g., wet and dry seasons). 
This allows for the use of simple recurrence 
relations (i.e., difference equations) to 
model the population genetics resulting 
from the release of such a gene drive. This 
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typical set of simplifying assumptions is also 
adopted in the example below. 

Much of the modelling of UD gene 
drives uses genotype-based population gen- 
etics models. In the case of a two-locus ap- 
proach such as UD, this results in a total of 
nine possible genotypes - homozygous, het- 
erozygous or wild-type for the transgene at 
each of two loci — and therefore a set of nine 
difference equations. However, since UD is 
based entirely on Mendelian inheritance 
and lethality to certain offspring genotypes, 
the offspring in each generation are directly 
related to the proportions of each allele pre- 
sent in the parental generation rather than 
the precise parental genotypes. This allows 
the consideration of a simpler model that re- 
quires the tracking of only the four haplotype 
frequencies (ab, Ab, aB and AB, where a/b 
represent wild-type alleles and A/B their trans- 
genic counterparts). This results in a set of 
four difference equations that may be solved 
recursively in a manner similar to that origin- 
ally presented by Davis et al. (2001). Magori 
and Gould (2006) considered a similar model 
structure but additionally allowed for multiple 
insertions of each transgene, though this is 
not included here. This model is of the form: 


ab, = — = (Equation 1) 
Q Q 
(Ze%ab.an,)+( Foran. ab,}+ (2°28, 8) 
_L\2 2 h (Equation 2) 
aB, > z? 
Q Q 
| Ze%ap,aB, (2%, ab, + (6°46, 8.) 
Ab... = 2 h (Equation 3) 
t+1 =~ = 
Q Q 
[5e°ab,aB, )+{50°aB,Ab, )+(e°aB,AB,) +(6°Ab,AB,)+(e*AB.) 
2 2 f 
AB, = Q = Q’ 
(Equation 4) 
where 


O=f, +h th th, (5) 
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is the sum of numerators in (Equation 1)- 
(Equation 4), e denotes the fitness (relative to 
wild-type) of an individual carrying a trans- 
genic construct and t denotes the generation 
from which the next allele frequency is com- 
puted. For simplicity, here we assume that 
each transgenic construct (A and B) confers 
the same degree of fitness cost on carrier indi- 
viduals and that these are applied multiplica- 
tively where individuals carry more than one 
transgenic construct (up to a maximum of 
four, where the relative fitness would be given 
by e*). Note that the parameter e can take any 
value in the range from zero (completely 
non-viable) to one (equally as fit as wild-type) 
and that the consideration of multiplicative 
relative fitness ensures that the overall value 
for any genotype also remains in the range 
from zero to one. Like much of the modelling 
literature, here we assume that toxins are 
fully penetrant (i.e., no viable offspring result) 
and similar for antidotes (i.e., a single anti- 
dote copy provides full rescue). 

The model presented here provides one 
of the simplest possible models of UD gene 
drive and is useful for determining various 
base-level characteristics of the system. As 
with all models, this is based on a range of 
simplifying assumptions (described above), 
each of which is likely to have its own impli- 
cations. There is a wide range of other mod- 
elling approaches that can be (and have 
been) used to capture the anticipated effects 
of relaxing one or more of these model as- 
sumptions. Several of these are briefly dis- 
cussed in the following sections, focusing 
primarily on results rather than extensive 
modelling detail. Modelling of UD systems 
has spanned a range of model structures, 
including difference equations, ordinary 
differential equations (ODEs), delay differ- 
ential equations (DDEs), partial differential 
equations (PDEs) and stochastic models, 
each of which provides insights into differ- 
ent aspects of anticipated UD behaviour. 


12.4 Introduction Thresholds 


Underdominance acting at a single locus has 
been shown to produce bistable dynamics: 
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either homozygotic state can be stable, de- 
pending on the initial frequencies and 
relative fitness of each type, as shown for 
underdominant alleles (Wilson and Turelli, 
1986; Altrock et al., 2010, 2011) and chromo- 
some translocations (Curtis, 1968). UD gene 
drives seek to capture a similar effect syn- 
thetically, via the introduction of toxin and 
antidote elements. While the threshold 
dependence of UD has been demonstrated 
using numerical simulation under a range of 
release sizes, to our knowledge a full equilib- 
rium analysis has yet to be conducted. This 
can be achieved either analytically or com- 
putationally and here we focus on the latter, 
using the numerical continuation software 
package XPPAUT (Ermentrout, 2002), pro- 
ducing results shown in Fig. 12.2. 

These results show two distinct regimes 
of behaviour separated by a particular rela- 
tive fitness parameter e* ~ 0.725 (as these 
are applied multiplicatively, this gives UD 
double homozygotes a relative fitness of just 
~0.276). In the region e < e* there are two 
possible equilibrium states, with either the 
wild-type (stable) or gene drive (unstable) 
alleles at fixation. This can be interpreted as 
a scenario in which the gene drive is unable 
to establish itself, no matter how many gene 
drive-carrying individuals are introduced. 
The unstable equilibrium whereby gene 
drive alleles are at fixation is not biologically 
feasible, since it would imply there were no 
wild-type individuals present at the time of 
release — rendering the release of a gene 
drive unnecessary. The more interesting re- 
gion (e > e*) displays four possible equilib- 
rium states. The first is the unstable (and 
not biologically feasible) equilibrium state 
with gene drive alleles at fixation. The three 
remaining equilibria together constitute a 
bistable system (i.e. two stable equilibria 
separated by an unstable equilibrium). Fo- 
cusing on the gene drive allele (Fig. 12.2c) 
below the unstable equilibrium, the elimin- 
ation of the gene drive is the only stable 
equilibrium - representing negative selec- 
tion against the gene drive when introduced 
at a sub-threshold frequency. Above the 
unstable equilibrium, the gene drive moves 
towards a stable equilibrium with high 
gene drive frequency — representing positive 
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Fig. 12.2. A bifurcation diagram showing the possible equilibria of a two-locus engineered 
underdominance gene drive and their associated stability properties. Here stable equilibria are 
shown by solid black lines while unstable equilibria are shown by red dashed lines, with panel (a) 
showing ab (i.e. fully wild-type), (b) Ab or aB (i.e. a single UD allele) and (c) AB (i.e. both UD alleles) 
haplotype frequencies. These diagrams show the bistable nature of this gene drive approach. When 
introduced above a certain threshold, the gene drive system increases in frequency, towards the stable 
equilibrium with non-zero AB haplotype frequency. When introduced below this threshold, the system 
decreases in frequency, heading towards the stable equilibrium with a zero AB haplotype frequency. 
These diagrams also demonstrate the existence of a maximum tolerable fitness cost for two-locus 
engineered underdominance gene drive systems of around 28% per construct. Bifurcation analysis was 
conducted using XPPAUT continuation software (Ermentrout, 2002) and results were plotted using 
MATLAB (R2020b, The MathWorks Inc., Natick, Massachusetts). 


selection when introducing the gene drive 
above the threshold frequency. Theoretically 
it is possible that the system would attain 
the unstable equilibrium state and remain 
there; however practically this is exceedingly 
unlikely, due to the many and varied sto- 
chastic effects present in the real world. 

A feature evident in the results of Fig. 
12.2 is that the equilibrium state for a UD 
system does not necessarily comprise only 
gene drive homozygotes. Where there are no 
fitness costs associated with the gene drive, 
the system can reach fixation (i.e. 100% 
gene drive homozygotes). However, where 
fitness costs are non-zero, wild-type alleles 
are expected to be present at a frequency 
that increases with the fitness costs of the 
system (Fig. 12.2b). 

While useful in displaying the bistable 
(i.e. threshold-dependent) nature of a UD 
gene drive, Fig. 12.2 is not necessarily of dir- 
ect use when planning a gene drive release. 
This is due to the combination of AB and 
Ab/aB haplotypes present at the unstable 
equilibrium (i.e. the introduction thresh- 
old). In practice it would be more convenient 


to know a single gene drive frequency above 
which the system must be introduced for it 
to increase in frequency within the target 
population. Fortunately, this can be calcu- 
lated by summing to obtain the overall gene 
drive allele frequency for each point on the 
unstable equilibrium line. This results in a 
single threshold gene drive allele frequency 
(as shown in Fig. 12.3) that must be ex- 
ceeded through any combination of hetero- 
zygote or homozygote individuals. Note that 
these results align with the pattern observed 
in several previous studies (e.g. Magori and 
Gould, 2006; Edgington and Alphey, 2017, 
2018; Dhole et al., 2018, 2020; Leftwich 
et al., 2018), though precise thresholds may 
differ due to assumptions on the application 
of fitness costs to individuals carrying mul- 
tiple transgenic constructs and the choice of 
presentation method. 


12.5 Relaxing Model Assumptions 


As discussed above, gene drive models are 
based on a range of simplifying assumptions, 
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Fig. 12.3. Threshold introduction frequencies required for an engineered underdominance system 
to spread in a target population over a range of relative fitness parameters. Note that this figure was 
generated by summing gene drive allele frequencies from results in Fig. 12.2, but mirror those previously 
shown in Edgington and Alphey (2017, 2018), using alternative mathematical models — although some 
slight differences are observed due to assumptions on how fitness costs should be applied to individuals 


carrying multiple transgenic constructs. 


each of which has its own implications for 
the outcomes and applicability of models to 
different scenarios. The model outlined in 
section 12.3 represents one of the simplest 
useful representations of a UD gene drive 
system and, as shown in section 12.4, allows 
key characteristics of this gene drive design 
to be elucidated. Of course, this model struc- 
ture can be altered to allow for the relaxation 
of any of the model assumptions outlined 
above, thereby allowing their implications to 
be explored. In the following sections we 
discuss studies exploring the relaxation of 
three such model assumptions: (1) the pres- 
ence of resistance formation and mutation 
of transgenic constructs; (2) the reversal of 
UD gene drives; and (3) the incorporation of 
spatial effects. 


12.5.1 Resistance formation 


and mutation 


A common set of simplifying assumptions 
for gene drive modelling studies is that 


elements within a single transgenic con- 
struct are perfectly linked (i.e., unable to 
separate), do not undergo mutation (i.e., 
lose function of transgenic components) 
and that no other resistance mechanisms 
emerge. Edgington and Alphey (2019) 
relaxed this assumption by modelling a scen- 
ario whereby transgenic constructs accumu- 
late loss-of-function mutations at a constant 
rate. To our knowledge, rates of mutation in 
insects likely to be targeted by gene drives 
are not well studied and could vary consider- 
ably, depending on the molecular biology of 
the gene drive system. Thus, mutation rates 
(per gene) are assumed to fall within the 
range 10-10 that should span rates rele- 
vant to a range of target insect species. This 
parameter range is based on measured 
mutation rates in Drosophila melanogaster 
(estimated as being of the order 10° per nu- 
cleotide per generation) (Haag-Liautard 
et al., 2007; Keightley et al., 2014); the size 
of gene drive constructs in previous studies 
(~1-10 kb) (Windbichler et al., 2011; Reeves 
et al., 2014; Champer et al., 2017) and an 
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Fig. 12.4. Transgenic constructs are assumed to mutate at a rate of m per gene. This is assumed to 
be low enough that multiple mutations per generation may be neglected. For example, the initial transgenic 
construct (say, A) mutates at a rate of 3m, producing mutations in the lethal (giving A,), suppressor (A,) 
and the cargo (A,) gene each at a rate of m. Then, transgenic constructs possessing one mutated gene 
(e.g. A,) mutate at a rate of 2m (giving A, and A, each at rate m). Transgenic constructs with two mutated 
genes (e.g. A ¿) then mutate at rate m, producing constructs with all three genes mutated (i.e. A, ,,). Here, 
non-mutated genes are represented by black squares whereas genes with loss-of-function mutations are 
shown in red circles. (Figure originally published in Edgington and Alphey, 2019.) 


assumption that ~1-10% of nucleotides in 
transgenic constructs are essential for gene 
function. Note that these mutation rates are 
assumed to be low enough that multiple muta- 
tions (i.e., in more than one gene) within a 
single generation may be neglected, leading 
to the pattern of mutation shown in Fig. 12.4. 

The original study describing this scheme 
of mutation (Edgington and Alphey, 2019) 
considered a genotype-based formulation, 
resulting in a set of 2025 genotypes, of 
which 819 were non-viable. This could be re- 
duced to a haplotype-based formulation 
with 81 haplotypes (81 difference equa- 
tions), making the model simpler and faster 
to formulate, code and simulate. 

This study found that UD displays an 
increase in frequency that is almost com- 
pletely unaffected by such mutation where 
mutated transgenic constructs conferred a 
greater fitness cost than their non-mutated 
counterparts — such cases will not be dis- 
cussed further. Loss-of-function mutations 
are therefore only of concern where mutated 
transgenic constructs are of higher fitness 
than non-mutated versions. Here, the intro- 
duced UD system would initially increase in 
frequency if introduced above the required 
threshold. Over time each type of mutated 
transgenic construct will begin to accumu- 
late, with the rate of accumulation and 
maximum frequencies varying depending 
on which loss-of-function mutations are 
present. Results in Edgington and Alphey 
(2019) show that, for a range of mutation 
rates and fitness costs, it is transgenic constructs 
with a single loss-of-function mutation in 


either the lethal or cargo gene that achieve 
the greatest maximal frequencies (Fig. 12.5). 
Constructs with loss-of-function in two 
genes achieve lower frequencies and are 
dominated by those where the antidote 
(suppressor) gene is unaffected. Combined, 
the mutated transgenic constructs reach 
high overall frequencies, with a concurrent 
decrease in the frequency of non-mutated 
constructs. Since UD approaches typically 
achieve an equilibrium in which wild-type 
alleles remain present (see Fig. 12.2), these 
begin to replace the mutated transgenes due 
to their relative fitness advantage, eventu- 
ally returning the population to a fully wild- 
type state (Fig. 12.5). It is yet to be studied 
in depth whether the stable co-existence of 
mutated and non-mutated transgenic con- 
structs is possible, but it was not observed 
under any parameter set or initial condition 
considered in Edgington and Alphey (2019). 

For different fitness costs and mutation 
rates, the observed dynamics remained 
broadly similar to those in Fig. 12.5, though 
the timescales and maximal frequencies of 
each mutation vary. Higher mutation rates 
reduce the period over which the UD system 
persists at high frequency - essentially the 
period in which the gene drive would main- 
tain efficacy. Even though UD systems can 
be eliminated by mutations, they are pre- 
dicted to remain at high frequency for hun- 
dreds of generations - likely long enough for 
the system to have produced its desired ef- 
fect. As an example, Aedes aegypti mosquitoes 
(vectors of dengue, Zika, yellow fever and 
chikungunya viruses) undergo approximately 
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Fig. 12.5. An example numerical simulation showing the effects of mutation within transgenic 
constructs of an engineered underdominance gene drive system. Results here are presented for a 
1:1 (introduced to wild) introduction of non-mutated double homozygote (AABB) individuals into a 
wild-type (aabb) population. Note that several alleles only reach very low maximum frequencies and thus 
they appear to overlie one another along the horizontal axis. Results are shown for an engineered 
underdominance system with 5% fitness cost per non-mutated construct, a 4% fitness cost per mutated 


construct and a mutation rate of m = 10. 


ten generations per year, meaning that the 
introduced UD system should persist at high 
frequency for at least ten years, even in the 
face of accumulating loss-of-function trans- 
gene mutations. 


12.5.2 UD reversal 


Ideally, a gene drive, or any other interven- 
tion, should be simple to reverse in the event 
of any unintended and undesirable conse- 
quences. For threshold-dependent drives 
such as underdominance (UD), the simplest 
mechanism for this is the release of wild- 
type individuals. Releasing a sufficiently 
large number of wild-type individuals can 
push the UD system below the threshold fre- 
quency, thus it undergoes negative selection 
and is driven out of the population (consider, 
for example, Fig. 12.2). To our knowledge, 
only one alternative reversal mechanism has 
been proposed for reversing UD, namely re- 
leasing individuals carrying free suppressors 


(i.e., individuals carrying just the antidote 
from one or both of the original transgenic 
constructs) (Edgington and Alphey, 2019). 
This was proposed since modelling of muta- 
tion in UD systems showed significantly 
greater accumulation of mutated constructs 
that retained function of the antidote gene, 
which undergo positive selection where a 
UD system is at high frequency. The same 
study also showed that if free-suppressor 
elements conferred a non-zero fitness cost, 
then the positive selection would be lost as 
the UD system became rare in the popula- 
tion, thus allowing wild-type alleles to 
outcompete the free suppressor element, 
returning to a fully wild-type state. 

With there being two viable mechan- 
isms for UD reversal, it is instructive to com- 
pare the two. Each has benefits, but which 
would be more useful in real-world scen- 
arios? If a released UD system were to require 
reversal, then it would be reasonable to as- 
sume that introducing large numbers of 
wild-type individuals would be undesirable, 
as it could potentially increase the population 
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above pre-control levels, albeit transiently. 
Intensive suppression might reduce num- 
bers in the entire population such that the 
necessary wild-type releases remain below 
pre-control levels, but that would have its 
own costs and issues. In some cases, issues 
around release of wild-type individuals could 
be alleviated somewhat by releasing individ- 
uals of a single sex (as discussed in Leftwich 
et al., 2018). For example, in many insect 
species (and certainly many of those likely to 
be gene drive targets) females provide both 
the reproductive and epidemiological poten- 
tial of the population; thus, releasing wild- 
type males should generally be quite benign. 

The release of free-suppressor individuals 
provides an alternative reversal mechanism 
and addresses some of the issues associated 
with wild-type release(s) but also has some 
drawbacks of its own. For example, free- 
suppressor individuals are theoretically able to 
function from an extremely small release. In 
practice, it would be desirable to perform re- 
lease(s) large enough to avoid stochastic loss at 
low frequencies. Despite this, it should still be 
feasible to release far fewer individuals than 
required for wild-type reversal and these may 
be of a single sex. However, there are also some 
potential drawbacks that need to be weighed 
against these benefits. Firstly, free-suppressor 
releases have been shown to function much 
more slowly than wild-type reversal. For in- 
stance, Edgington and Alphey (2019) show 
that a 2:1 (reversal to wild) release of wild-type 
individuals can (approximately) eliminate 
the UD gene drive in about 20 generations 
whereas an equal release of free suppressor 
individuals took about 130 generations to 
reach the same point, with about a further 
150 generations required for the free- 
suppressor individuals to be (approximately) 
eliminated. Another potential issue is whether 
the appropriate regulatory body would 
approve the release of further transgenic in- 
sects if the original system required reversal 
due to unintended negative effects. 


12.5.3 Spatial effects 


A key feature of UD gene drives is their 
threshold-dependent nature, since this is 
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often stated to be capable of preventing the 
system from establishing in non-target 
neighbouring populations. It may even 
prevent the system reaching an appreciable 
frequency, due to negative selection when 
present at sub-threshold frequencies. The 
modelling of such spatial factors is therefore 
important in assessing how well confined 
UD systems will remain and under what con- 
ditions this confinement could potentially 
fail. These questions are expected to prove 
important when seeking to attain regulatory 
approval for UD releases into the field; with 
highly robust confinement, regulatory approval 
far beyond the release site(s), for example 
regional or multi-national approval, may not 
be required, in contrast to current thought 
regarding more invasive approaches. 

Spatial effects can conceivably be stud- 
ied in a variety of ways, including n-deme 
population genetics (difference equations), 
n-deme population dynamics (ordinary or 
delay differential equations), reaction- 
diffusion (partial differential equations) or 
individual-based models. Each of these has 
been used in the study of gene drive ap- 
proaches, although not all in the context of 
UD, and possesses its own positive and nega- 
tive features. Here we discuss a range of 
these approaches in the context of UD gene 
drives, focusing primarily on findings rather 
than technical details. While we cover a 
broad range of studies here, this is by no 
means intended as a comprehensive review. 

Perhaps the most commonly used tech- 
nique for assessing spatial properties of 
gene drives are n-deme population genetics 
models. Briefly, these consider two or more 
demes (semi-isolated (sub)populations), 
each of which has its own set of difference 
equations of the form outlined in section 
12.4. For simplicity, the literature primarily 
considers a scenario with just two demes. 
These models capture the migration of indi- 
viduals between demes via a simple ex- 
change of a proportion of individuals in each 
generation. It is commonly assumed that 
the two populations are of equal size, such 
that the number of each migrant type is sim- 
ply based on the haplotype frequencies in 
each population - a reasonable assumption 
where large (modelled as infinite) populations 
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are considered. This approach has been used 
to consider spatial aspects for a wide range 
of gene drive classes, including UD (Mar- 
shall and Hay, 2012; Harvey-Samuel et al., 
2019; Edgington et al., 2020b). One such 
model estimated that a UD system with a 
homozygote fitness cost of 5% (applied ad- 
ditively) and a bidirectional migration rate 
of 1% per generation would reach near-fixa- 
tion in the target population (e.g. results in 
Fig. 12.2) but reaches a frequency of just 
0.032 in a non-target neighbouring popula- 
tion (Marshall and Hay, 2012). The same 
work then went on to estimate that the same 
UD system would require a bidirectional mi- 
gration rate of 4.3% per generation to be- 
come established in both populations, thus 
supporting the notion that UD is robustly 
confineable. 

The approximation of equal population 
sizes has also been relaxed in a number of 
studies focusing both on UD (Dhole et al., 
2018) and on other gene drive classes (Dhole 
et al., 2019, 2020). These studies include an 
approximate scaling of migration rates to 
account for the differences in respective 
population sizes that result from gene drive 
fitness cost and lethal effects. While this cap- 
tures an additional level of realism absent in 
the non-scaled migration models, the results 
obtained do not suggest that this will have a 
large impact on the ability of UD systems to 
remain confined. In fact, we would suggest 
that this is likely to improve the confine- 
ment, since transgene fitness costs and le- 
thal effects will reduce the target population 
size, meaning fewer migrants into the 
non-target population. However, this will 
likely increase the influence of wild-type 
migrants from the non-target population, 
potentially creating a necessity for slightly 
larger UD release(s) to ensure the system re- 
mains above the introduction threshold. 

Several studies have also considered 
extensions to these population genetics 
models by taking account of various eco- 
logical factors affecting the life cycle and size 
of the target and non-target insect popula- 
tions when subject to the release of a UD 
gene drive (e.g., Edgington and Alphey, 
2018; Khamis et al., 2018, 2020). These each 
consider their own model structures to 
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capture density-dependent effects during 
the immature stages of the insect life cycle, 
each with its respective advantages and dis- 
advantages. One density-dependence func- 
tion used in such models is that of Maynard 
Smith and Slatkin (1973) and is of the form: 


f(N)=(1+(aN)’) 


where N denotes the size of the insect 
population, a is a density parameter (1/a re- 
lates to the number of breeding sites) and b 
defines the strength of density-dependent 
competition. This function is known to be 
flexible in that it can capture a range of den- 
sity-dependence scenarios (Bellows, 1981) 
and has been used in the study of UD (Edg- 
ington and Alphey, 2018) and other gene 
drive classes (Alphey and Bonsall, 2014). 
This work outlines a variety of possibilities 
not extensively discussed in the results of 
population genetics models. In particular it 
identifies three possible outcomes of a UD 
release: (1) no introgression in either popu- 
lation; (2) establishment in both popula- 
tions; and (3) introgression into the target 
population with extremely limited spread 
into the non-target population, with the lat- 
ter usually considered the most desirable 
outcome for UD (Edgington and Alphey, 
2018). Sanchez et al. (2020a) considered 
similar effects for UD™" and reciprocal 
chromosome translocations using a com- 
putational framework called MGDriveE 
(Sanchez et al., 2020b; Wu et al., 2021). 

The above approaches consider spatial 
effects via an exchange of individuals 
between two (sub)populations, which are 
assumed to be well mixed. A possible exten- 
sion to this work is to consider spatial effects 
explicitly, using a model defined over a con- 
tinuous spatial domain (e.g., Champer et al., 
2020d). This work takes a fully computa- 
tional approach, using an individual-based 
model implemented in the open-source 
software package SLiM (Haller and Messer, 
2016, 2019). Here, two circular regions (sub- 
populations) are linked by a narrow migra- 
tion corridor, with movement assumed to re- 
sult from the birth of new offspring (Champer 
et al., 2020d). This showed that UD is robust 
against re-invasion by wild-type but may 
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display a greater degree of invasiveness into 
neighbouring populations than predicted 
with the spatially implicit model structures 
discussed above. However, the narrow mi- 
gration corridor essentially forces migrating 
individuals to encounter those moving in 
the opposite direction, creating an approxi- 
mately linear boundary between the two 
populations — a scenario shown to facilitate 
easier gene drive invasion (Champer et al., 
2020d). 

An alternative scenario would allow mi- 
gration to occur over a wider space, meaning 
migrants encounter those moving in the op- 
posite direction far less frequently, thus 
eliminating the linear boundary within the 
migration corridor. Here migrants would 
first encounter individuals when arriving at 
the boundary of the opposite population, 
meaning they would encounter either a very 
high or very low local gene drive frequency, 
likely producing results closer to those from 
spatially implicit model structures. Such 
variation highlights the importance of under- 
standing a wide range of species, location 
and ecological traits when predicting the 
outcome of a real-world UD release. 

While the above approaches assess the 
likelihood of UD invading non-target popu- 
lations, spatial effects are also important in 
determining the ability of UD to spreadina 
given target population. This has also been 
addressed using a variety of different model- 
ling approaches. 

One possibility is to consider a lattice- 
based model in which the target region is 
discretized into a collection of cells, each 
containing a well-mixed pool of individuals 
(Huang et al., 2011). Individuals move be- 
tween cells according to a dispersal kernel, 
defining the probability of an individual 
moving between any two cells in the lattice 
on any given day. This model structure was 
used to compare the relative efficacy of two 
UD release methods: (i) release into one 
large area; and (ii) release into many small- 
er, equally distributed areas. Interestingly, 
this work showed that either release 
method could be more effective, depending 
on the degree of mobility exhibited by indi- 
viduals and the fitness costs associated 
with the UD system. 
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The individual-based model of Champer 
et al. (2020d) has also been used to explore 
the ability of UD to spread within a single 
population. This considered two spatial 
scenarios based on the shape of the UD re- 
lease area, namely a scenario with either a 
straight-line (linear) scenario or a circle div- 
iding regions of high and low/zero gene 
drive frequency. Interestingly, the UD sys- 
tem was able to spread or persist more read- 
ily in the linear scenario than the circular 
one. This was proposed to be a result of the 
local gene drive frequency being lower for 
the circular case, since the wild-type par- 
tially wraps around the high UD region. 
However, we would expect this effect to 
diminish rapidly as the circular region in- 
creases in size (reducing the curvature of 
the boundary). 

Finally, another potential approach for 
considering spatial effects in either single or 
linked populations is the use of reaction- 
diffusion equations (i.e., partial differential 
equation models). These have been explored 
in the context of highly invasive CRISPR- 
based gene drives (Beaghton et al., 2016; 
Tanaka et al., 2017), but to our knowledge 
have yet to be widely applied to thresh- 
old-dependent systems. Such models should 
enable a wide range of spatial scenarios to be 
considered, while allowing various sources 
of heterogeneity to be considered across the 
spatial domain. These models can also allow 
an explicit representation of insect migra- 
tion to be incorporated into population dy- 
namics model structures, such as those in 
Edgington and Alphey (2018) and Khamis 
et al. (2018, 2020). 


12.6 Linking Theory and 
Experimentation 


The development of gene drive technolo- 
gies, including UD, has generally followed a 
design—build-test cycle (Fig. 12.6). At each 
stage, modelling can play an important role 
in designing, understanding and analysing 
experimental work. Thus, in addition to 
providing insights, modelling can save a sig- 
nificant amount of research time, effort and 
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Fig. 12.6. A cartoon showing the design—build-test 
cycle commonly followed in the development of 
gene drive technologies and that mathematical 
modelling is an important tool within each phase. 


money. To date there has been relatively lit- 
tle published experimental work on UD and 
so this section focuses on how one could link 
theory and experiment when experimental 
data becomes available. This will partially be 
informed by studies on alternative gene 
drive classes and will describe each phase of 
the design-build-test cycle; however, since 
the ‘design-build’ aspect of this cycle has 
largely been covered in previous sections, it 
will not be discussed further here. 

In the ‘build—test’ phase, experimenta- 
tion commonly focuses on discrete gener- 
ation laboratory cage-based experiments. 
Models of the form shown in section 12.4 
assume discrete generations and so they are 
ideal for predicting and analysing experi- 
ments of that form. In the first instance, 
specific test crosses between transgenic in- 
sect strains can be performed and the num- 
ber of offspring of each resulting genotype 
counted/screened (as in Hammond et al., 
2016, 2017, 2020; Kyrou et al., 2018; Adolfi 
et al., 2020; Champer et al., 2020b,c; Simoni 
et al., 2020), with fluorescent markers com- 
monly used to distinguish between types. 
This allows a first approximation of various 
system parameters (for example, relative fit- 
ness, toxin penetrance and strength of the 
antidote effect (Buchman et al., 2018b; Web- 
ster et al., 2020)) to be generated by calculat- 
ing ratios between the mean number of each 
genotype produced. These can then be used 
to parameterize models and predict the ex- 
pected outcomes of gene drive cage trials. 
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Note that other gene drive classes may ap- 
proximate fitness costs by fitting models to 
data from cage trials with only a single 
transgenic construct present (e.g. Webster 
et al., 2020), but this is not feasible for UD 
where a single transgenic construct pro- 
duces a lethal phenotype. 

Following the ‘test’ phase, i.e., after 
laboratory cage trial experiments, prior 
predictions of gene drive behaviour can be 
compared with the actual cage trial data. 
Using the same mathematical models (and 
potentially stochastic versions also, espe- 
cially given the relatively small numbers of 
individual mosquitoes typical of a labora- 
tory cage experiment), one can then assess 
whether model predictions are consistent 
with the observed outcomes (as in Ham- 
mond et al., 2016, 2020; Buchman et al., 
2018b; Kyrou et al., 2018; Pham et al., 2019; 
Adolfi et al., 2020; Champer et al., 2020c; 
Simoni et al., 2020; Webster et al., 2020). If 
model and experimental results are consist- 
ent, then parameters can be varied to iden- 
tify potential areas for improvement in the 
gene drive design. Conversely, if model and 
experimentation are not consistent, then 
further modelling may be required to iden- 
tify sources of this mismatch, potentially 
informing future models and experimental 
designs (as in Hammond et al., 2017, 2020, 
for CRISPR-based gene drives). 


12.7 Alternative Configurations of UD 


Previous sections focused on a specific con- 
figuration of UD, namely that based on two 
mutually repressing bi-sex toxin genes, in- 
serted into two unlinked and independently 
segregating genomic loci. Similarly, we have 
primarily focused on the release of both 
transgenic males and females. However, 
there is a wide range of alternative methods 
for engineering/releasing UD systems, a var- 
iety of which have been studied previously. 
For the UD design considered thus far, a 
potential variation is in the time at which 
the toxin takes effect. This has been studied 
for a UD system with toxins and transgene 
fitness costs acting either in early (before 
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density-dependent competition, for example 
eggs or early instar larvae for mosquitoes) or 
late (following density-dependent competi- 
tion, for example pupae or pharate adults in 
mosquitoes) developmental stages (Edging- 
ton and Alphey, 2018; Khamis et al., 2018). 
While early- or late-acting lethal/fitness ef- 
fects do not have any impact on threshold 
introduction frequencies, they can have 
more impact on other traits. For instance, 
Khamis et al. (2018) found that, for a system 
spreading a cargo gene conferring refractori- 
ness to a pathogen, late-acting lethality pro- 
duced a slightly larger reduction in disease 
burden. This is likely due to the greater re- 
duction in both equilibrium and (transiently 
attained) minimum population sizes ob- 
served with late-acting lethality (as seen in 
Edgington and Alphey, 2018). Despite this 
potential epidemiological benefit, a greater 
reduction in population size may not always 
be good news. For example, Ae. aegypti mos- 
quitoes are known to compete with Aedes 
albopictus (Edgerly et al., 1993; Juliano et al., 
2002; Armistead et al., 2008). Therefore, an 
Ae. aegypti population may be displaced 
during the period in which the population is 
reduced by a late-acting UD - potentially re- 
ducing the epidemiological benefit as Ae. al- 
bopictus are also competent vectors of a 
similar set of pathogens, including dengue 
viruses (WHO, 2011). This necessitates some 
knowledge of ecological factors in the vicin- 
ity of gene drive target areas, and could be 
addressed by modelling approaches similar 
to those used for sterile insect technique 
(SIT) and release of insects carrying a dom- 
inant lethal (RIDL)-based control (Bonsall 
et al., 2010). 

Other possible UD configurations re- 
volve around the use of sex-specific toxins 
or insect releases, rather than the bi-sex 
versions considered above. Such consider- 
ations have been studied in terms of their 
effect on release thresholds and degrees of 
tolerable transgene fitness costs (Edgington 
and Alphey, 2017). These results showed 
that considering either male-only release(s) 
of gene drive-carrying individuals or female- 
specific toxins results in a lesser ability to 
tolerate fitness costs and higher introduc- 
tion thresholds. 
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In a two-locus UD configuration, it is 
possible that the suppressor element from 
one transgenic construct is not sufficient to 
inactivate two copies of the toxin gene from 
the other transgenic construct. In the con- 
text of UD, this has been referred to as ‘weak 
suppression’ (Edgington and Alphey, 2017, 
2018); however, the mathematical models 
and predicted dynamics are equally ap- 
plicable to systems based on reciprocal 
chromosome translocations (Buchman etal., 
2018b). These studies showed that recipro- 
cal chromosome translocations (or weakly 
suppressed UD) generally have a higher 
introduction threshold than the UD systems 
discussed here. As discussed previously, this 
represents a trade-off between the increased 
cost/difficulty of gene drive introgression 
and the increased reliability of gene drive 
confinement to the target population. 

Several gene drive concepts are based 
on toxin-antidote systems. The UD system 
considered thus far assumes two mutually 
suppressing lethals, each of which comprise 
a ‘toxin’ gene and an antidote that sup- 
presses its effect, perhaps RNAi targeting 
the toxin gene. Other toxin-antidote con- 
cepts can also provide threshold-dependent 
gene drives. For example, a synthetic Medea 
drive was constructed in Drosophila using a 
maternally contributed (RNAi) toxin with 
zygotic expression of the antidote only in 
those offspring inheriting the Medea ele- 
ment (Buchman et al., 2018a). Medea is a 
low-threshold drive, zero-threshold in the 
absence of fitness costs, but a mutually re- 
pressing pair of such elements can provide a 
threshold-dependent drive known as UDM®" 
(Akbari et al., 2013) or double Medea (Wim- 
mer, 2013). Fora rather different molecular 
basis, a CRISPR/Cas9 system (toxin) can be 
used to disrupt an essential endogenous 
gene, which can then be rescued with a 
recoded - and therefore toxin-resistant — 
antidote. A range of threshold-dependent 
gene drives using this technology have pre- 
viously been modelled (Champer et al., 2021) 
and are based on the use of cleave and rescue 
(Oberhofer et al., 2019) or CRISPR toxin- 
antidote (Champer et al., 2021; Champer 
et al., 2020a; Champer et al., 2020b) elem- 
ents. These have been discussed extensively 
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in the original sources and produce broadly 
similar behaviour to the approach(es) dis- 
cussed here. Additionally, the mathematical 
modelling frameworks considered in the stud- 
ies listed above are similar to those explored 
throughout this chapter and so we do not 
discuss results of these studies any further. 


12.8 Areas of Future Interest 


Despite all the modelling work discussed 
above, there remain several areas in which 
further modelling could elucidate various 
characteristics of UD gene drives. Some have 
briefly been mentioned in the relevant sec- 
tions above and so we focus predominantly 
on areas not yet discussed. 

Above, we discussed the use of labora- 
tory cage trial experiments for inferring 
parameters of the UD system. While these 
are useful for predicting system perform- 
ance, these estimates are inherently flawed 
when moving into the field since they as- 
sume that laboratory wild-type strains - and 
environments — are a good approximation of 
insects in the wild. In practice, laboratory 
wild-type strains are recognized as having 
lower fitness than their wild counterparts 
(Leftwich et al., 2021), likely due to many 
generations of laboratory adaptation (Left- 
wich et al., 2016; Ross et al., 2019). Models 
can capture the impact of this to a certain 
degree by considering variation of relative 
fitness parameters about laboratory-derived 
estimates. However, the transition toward 
field-based experiments will potentially ne- 
cessitate more detailed models capturing a 
range of ecological, behavioural and fitness 
effects (some of which have been discussed 
above). This enhanced modelling can then 
help to inform the design of UD releases as 
they progress from small-scale field cage 
trials right up to the eventual release in full 
large-scale control programmes. 

A feature of most of the modelling dis- 
cussed here is that it is deterministic and so 
does not account for the stochasticity inher- 
ent in the real world. In the context of UD, 
this will be important when the release of 
transgenic insects results in a gene drive 
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frequency close to (or even below) the intro- 
duction threshold calculated from determin- 
istic mathematical models (i.e., the unstable 
equilibrium discussed above). Here stochas- 
tic models can provide insight into the ex- 
pected likelihood of success or failure (i.e., 
the probability that a UD system increases 
or decreases in frequency) of a given release 
strategy. To date, stochastic modelling of 
UD systems has been limited, to our know- 
ledge, to only Marshall and Hay (2012) for 
this UD configuration. However, some sto- 
chastic modelling frameworks have been 
used to study other gene drive classes, from 
which such work could take a lead (for ex- 
ample: Magori et al., 2009; Champer et al., 
2020a; Edgington et al., 2020a,b; Sanchez 
et al., 2020a; Wu et al., 2021). 

A common feature in the modelling of 
many gene drive classes, and, in particular, 
toxin—antidote-based approaches, is an as- 
sumption that toxins and antidotes are fully 
penetrant (i.e., that toxins kill 100% of tar- 
get genotypes and antidotes rescue 100% of 
carriers to full fitness). However, gene drive 
components engineered in the laboratory 
may not give this degree of efficacy. Labora- 
tory experiments, for example life history 
analysis of different genotypes, can provide 
initial estimates of such incomplete pene- 
trance. These data could be incorporated 
into models similar to those in section 12.4. 
In the absence of working gene drive compo- 
nents to test in the laboratory, one can use 
the same model to explore the expected be- 
haviour for a range of toxin and antidote 
penetrance parameters, thus setting per- 
formance targets for laboratory-engineered 
gene drive components. This could be used 
to assess performance metrics including 
threshold frequencies, the speed of spread, 
the system invasiveness (with n-deme ver- 
sions of the models) and tolerable fitness 
costs — all of which are likely to be vital when 
transitioning from laboratory to field-based 
testing. 

The motivation for genetic control of 
mosquitoes is to reduce or prevent morbid- 
ity and mortality from mosquito-borne dis- 
eases. Thus, it is important to explore the 
anticipated epidemiological impact(s) ex- 
pected from a given gene drive and release 
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strategy. This can be explored by incorporat- 
ing a gene drive model into a standard 
epidemiological modelling framework, for 
example susceptible-exposed-infectious— 
recovered (S-E-I-R) or a variety of extensions/ 
modifications as previously considered for 
Wolbachia (Ndii et al., 2015, 2016a,b; Zhang 
and Lui, 2020) or RIDL (Atkinson et al., 
2007) control approaches in Ae. aegypti mos- 
quitoes. This will likely require the use of a 
population dynamics model similar to those 
of Khamis et al. (2018) and Edgington 
and Alphey (2018) for two main reasons: 
(i) models must produce results for gene 
drive and epidemiological dynamics at all 
time points; and (ii) the respective sizes of 
human and insect (vector) populations are 
important in determining a pathogen’s 
force of infection. Such models can provide 
important insights into potential epidemio- 
logical impacts. However, various factors 
required to formulate these models (such as 
infection numbers, exact population sizes, 
transmissibility of pathogen(s) and biting 
frequency) can be extremely difficult to 
measure, meaning that the consideration of 
model uncertainty will be important when 
interpreting results. 

This chapter has focused on the use of 
mathematical modelling to predict the 
efficacy of UD gene drives from molecu- 
lar design and laboratory testing right 
through to field testing and final applica- 
tions. Such studies will likely be important 
in providing an evidential basis upon which 
regulatory decisions can be made. As further 


M.P. Edgington and L. Alphey 


laboratory-based testing provides more and 
higher-quality data, we would anticipate 
that more detailed and species-specific 
models will be developed, providing greater 
insight into the anticipated efficacy of UD 
systems. Likewise, as more field studies into 
the ecology of potential gene drive target 
species and field-trial releases (of this or 
other technologies) become available, more 
detailed ecological, epidemiological and 
behavioural factors can be studied and in- 
corporated into models, enabling the best 
possible predictions of gene drive function 
following release of transgenic insects. The 
previous literature and future focus areas 
discussed here demonstrate the key role 
that modelling plays in the development of 
gene drive technologies and emphasizes the 
necessity for gene drive research and 
development to follow an interdisciplinary 
approach. This will ensure that any future 
gene drive release has the greatest opportun- 
ity to function as intended, thus providing 
the maximum possible beneficial impact. 
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